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Abstract 

We study the rate of convergence to a normal random variable of the real and 
imaginary parts of Tr A^U, where U is an N x N random unitary matrix and An is 
a deterministic complex matrix. We show that the rate of convergence is 0(N- 2 + b ), 
with < b < 1, depending only on the asymptotic behaviour of the singular values 
of A jy; for example, if the singular values are non-degenerate, different from zero 
and 0(1) as N —> oo, then b = 0. The proof uses a Berry-Esseen inequality for 
linear combinations of eigenvalues of random unitary matrices, and so appropriate 
for strongly dependent random variables. 
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1 Introduction 

The value distributions of traces of random unitary matrices have been studied extensively 
over the past fifteen years [HI E31 EE EH EH [EH HD E5] • The main reason is that they are 
connected with the linear statistics 

S N (x)--=x{e i9l ) + --- + x(e l9N )i (1.1) 

where \ is a suitable test function and e ldl , . . . , e ldN are the eigenvalues of N x N unitary 
matrices U distributed according to Haar measure. It turns out that in many applications in 
particle physics, open quantum systems, quantum chromodynamics and scattering theory 
it is interesting to understand the asymptotic (N — » oo) behaviour not only of Tr U but of 
the more general random variable 

Z N ■= V N + iW N = Tr A N U, (1.2) 

where Vn (respectively Wn) is the real (imaginary) part of Z^, and An is a deterministic 
complex matrix. (See, e.g., [221 12| [3| 123] and references therein.) In other words, we want 
to understand the distribution of linear combinations of the elements of random unitary 
matrices. In general, this type of question arises when Random Matrix Theory is applied 
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to non-Hermitian quantum mechanics, an area of physics which has grown rapidly in the 
last decades (see, e.g., [T9| 120] and references therein). As we shall see, the invariance of 
Haar measure on XJ(N) under group action implies that the distributions of Vn and Wn 
are the same. Therefore, we shall restrict our attention to Vn- 

Samuel [22J and Bars j2] computed the first few terms in the cumulant expansion of Vn, 
which implicitly show that it converges in distribution to a normal random variable when 
iV — > oo. D'Aristotile et al. [TO] gave a rigorous proof of this result. Collins and Stolz [6] 
proved a multivariate version of this theorem: they showed that a vector of the form 



where r is independent of N, converges to a joint normal distribution. 

In her PhD thesis, Meckes [TTI [18] studied the rate of convergence of Vn to a central 
limit theorem using Stein's method of exchangeable pairs. Let us normalise An so that 
TyA n A* n = N, where A* N is the conjugate transpose of A N , and denote by Af(fi,cr 2 ) a 
normal random variable with mean /i and variance a 2 . Meckes proved that the distance 
of Vn to A/"(0, 1/2) in the total variation metric on probability measures is bounded by 
cnN~ 1 , where cn is asymptotic to 2v2. Chatterjee and Meckes [5] obtained a rate of order 
0(N~ l ) in the multivariate setting too, and showed that the constant is linear in r. 

The bound computed by Meckes holds for any An e C NxN , subject to the constraint 
Tr A N A* N = N. However, given a fixed sequence {A N } N>1 , it is natural to ask how the 
rate of convergence of Vn depends on ijy. The purpose of this paper is to show that this 
rate is O (iV~ 2+fc ), where < b < 1, depending only on the leading order asymptotics as 
iV — > oo of the greatest singular value of An. For example, if the elements of An do not 
grow with N — which is what one would expect for a generic sequence {An} n >i — then 
6 = and the rate of convergence is O (N~ 2 ). When 6=1 only a finite fraction of the 
singular values is different from zero in the limit iV — > oo. For technical reasons, which 
we will discuss in section l3~2] we exclude the case 6 = 1. Meckes' bound CjviV -1 does not 
discern the dependence of the rate of convergence on the singular values of An, and our 
result implies that it is sharp only when 6=1. 

Our approach is based on the method of moments, which allows us to prove a Berry- 
Esseen inequality for the eigenvalues of random unitary matrices. In general, Berry-Esseen 
bounds are used to prove central limit theorems for sums of independent or weakly depen- 
dent random variables. It is notable that such a bound exists for sums of eigenvalues of 
matrices in XJ(N), which are strongly correlated. 

When An is the identity, then Zn is a class function and the underlying group structure 
of U(iV) can be exploited. For general An these group-theoretical tools are not available. 
There is a considerable literature addressing the problem of the distribution of Tr W , where 
j G Z+. Diaconis and Shahshahani [9], and independently Haake et al. [13], proved that 
it convergences in distribution to y/]Z, where Z is a standard normal complex random 
variable. Diaconis and Shahshahani's proof is based on the method of moments; they 
showed that the k-th moments of Tr / \fj are exactly Gaussian for k < N. This property 




(1.3) 
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prompted Diaconis to conjecture that the convergence to a normal random variable is very 
fast, either exponential or even superexponential. Consider the error 

e(N) :=sup\F N (x) -Q(x)\ , (1.4) 

where 



:= -= / e~~dt (1.5) 

V27T J-r 



oc 



and F N (x) is the distribution function of i/2/j Re Tr [P, i.e. 

Ftv(x) := / /at (t) dt, (1.6) 



where /jv is the probability density function (p.d.f.). Johannson [T5] proved that e(N) = 
O t (N~ tN ). He also showed that the distance of y/2/j Re Tr U j to A/"(0, 1) in the total 
variation norm is of the same order. Such a rate of convergence to a central limit theorem 
is unusual in probability theory. The approach that we use to achieve our bounds also 
sheds light on why the convergence of TrCP is so fast. 

Subsequently, many authors have refined or improved Diaconis and Shahshahani's re- 
sults. Soshnikov [21] showed that the linear statistics (11. ip converge in distribution to a 
normal random variable in the mesoscopic regime too, i.e. if one considers eigenvalues in 
an arc of length L N with L N /N — > as iV — > oo. Hughes and Rudnick [TJ] studied the 
scaling limit = N . It turns out that the number of moments of SV(x) that are exactly 
Gaussian depends on the class of test functions considered. Diaconis and Evans [UJ used 
the results in [9j to study the asymptotic distributions of integrals of the type L fdE, 
where S is the unit circle and is the random point measure that places a unit mass at 
each eigenvalue e j . Pastur and Vasilchuk [2TJ and Stolz [25] gave alternative proofs of the 
convergence to normal random variables of Tr [P . 

This article is structured as follows. In §2] we discuss the background of the problem 
and introduce our main results. The moments and cumulants of Vn can be computed using 
the character theory of the symmetric group; these calculations are detailed in ^3]. In §|4] 
we present the proof of the Berry-Esseen inequality. Finally, ^5] and §|6] are devoted to the 
proofs of the main theorems. 



2 Statement of results 



2.1 Preliminaries 

Let us introduce the random variables 



X N := (ReTr A N U) / a = V N / a and Y N := (ImTr A N U) /a = W N /a, (2.1) 
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where U is an N x N unitary matrix distributed according to Haar measure and 

2 . Tr A N A* N 
° ■= — 2JT - - (2-2) 

The matrices in a given sequence {A/v}jv>i can be normalised so that a 2 is independent of 
N. 

Using the polar decomposition we can factorize A N in the product 

An = H N V, (2.3) 

where V G U(iV) and if^y = v / ^^n ^ s positive-semidefinite. Let us also write U = 
WQW*, where W e U(JV) and 9 = diag (e i91 , . . . , e iejv ). Since Haar measure is invariant 
under group action, the random variable Tr HnU / a has the same distribution as Tr AnU / a. 
Thus, without loss of generality, we can restrict An to the set of positive-semidefinite 
matrices. Furthermore, we have 

N 

Tr AnU = Tr A N WQW* = Tr W*A N WQ = Tr A N & = a^a^ , (2.4) 

3=1 

where An is Hermitian positive-semidefinite too and aaj > are its diagonal elements. 
Therefore, we can write 

Xn = ai cos 9i + ■ ■ ■ + aN cos 9n, (2.5a) 
Yn = a i sin9i + ■ ■ ■ + cin sin 9n- (2.5b) 

Since Haar measure is invariant under translation, Xn and Yn have the same probability 
distribution. Thus, we shall restrict our attention to Xn- 
The characteristic function of Xn is defined by 

MO :=Eo W {e^}. (2.6) 
It admits a representation as an integral over the unitary group. We have 

MO = ^ (iv) ex P \J^(^ A N U + ^ A %U*)) du. K (U), (2.7) 

where d/j,u denotes Haar measure over U(iV). When A N is not singular, such an integral 
can be evaluated explicitly [I] (see also [23] when the matrix in the second trace is different 
from A* N ): 

mo - [ T ) [U ,i j n^H-^) ' (28) 

where V\,...,vn are the singular values of An and </& is the Bessel function of the first kind. 
Unfortunately, this beautiful formula is not the best starting point for a straightforward 
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asymptotic analysis. In order to determine the rate of convergence of Xn, we will need to 
control iPn{0 when £ grows like a power of N. This means that N appears as a parameter 
in both the argument and the index of the Bessel functions. The facts that the asymptotic 
limit of Jk{x) as x — >• oo is not uniform in the index, and that all the Bessel functions from 
Jo to Jjv-i appear in the determinant, render the analysis of formula (12. 8 p difficult. 

Damgaard and Splittorff [7] computed the first few terms of the asymptotic expansions 
of integral ( 12. 7p for "low-mass" and "large-mass" . In our formalism, this means in the limit 
as £ — > and £ — > oo. 

The approach that we adopt is based on the method of moments, which can be com- 
puted explicitly up to the 2iV-th for any matrix An, whether singular or not. The only 
constraint that we impose on the sequence {An} n >i is the normalisation (12.21) . 

Our results will depend on the asymptotic properties of the singular values of An. 
Therefore, we need to introduce quantities that characterise their behaviour in the limit as 
N — > oo. Let us order the singular values of A N so that v\ < ■ ■ • < Un and let u N — O (N b ) . 
Then, define 

k := inf {c G E| u N < cN\ ViV > l} . (2.9) 

Since b is optimal, the normalization (12. 2p implies that < b < 1 and k > 0. The meaning 
of b and k can be illustrated with a few examples. If all the matrix elements of An are 
0(1) as N — > oo, then 6 = 0. Alternatively, consider the sequence of matrices 

A N = diag (y/2N, 0, . . . , oV (2. 10) 

Then 6 = 1 and k = 2. In other words, b not only gives the rate of growth of Un, but also 
measures how sparse the set of singular values is in the limit iV — > oo. 



2.2 Rates of convergence 

Using the same notation as in §TJ Fn and $ will denote the distribution functions of Xn and 
of a standard normal random variable respectively; similarly, Jn is the p.d.f. Furthermore, 
we shall write 



x 2 /2 



/oo 
e^ x (j)(x)dx = e~ e/2 . (2.11) 
-oo 



-oo 

r 2 



Theorem 2.1. Suppose {An} n> i ^ a sequence of matrices such that a = Tr AnA* n /(2N) 
is independent of N and that < b < 1 . We have 

e(N) :=sup\F N (x) -<S>(x)\=0(N- 2+b ) , N -»■ oo. (2.12) 

As we shall see, the power of minus two in (12.121) is determined by the Haar measure 
on U(iV). The sequence {An} n> i influences the rate of convergence only through the 
parameter b, which is a measure of the asymptotic distribution of the singular values of 
the matrices An- 

We can prove an analogous statement in the total variation norm. 
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Theorem 2.2. Let {An} n>1 be a sequence satisfying the same conditions as in theo- 
rem 12. 11 We have 

/oo 
\f N {x) - <p(x)\dx = O e (N- 2+b+e ) , N ->■ oo, (2.13) 
-oo 

where e G (0, |(1 - 6)). 

As we discussed in the introduction, for technical reasons theorems 12.11 and 12.21 exclude 
6 = 1. Meckes' [H] result suggests that they are correct for b = 1 too. 



The starting formula to prove theorems 12.11 and 12.21 is 



i f 

e(N)<- J 

(see [12J, p. 538), where 



« + ^ (2.14) 

7rT N 



m := max |0(x)| = (2-15) 

and T/v is an appropriate cut-off. Formula (I2.14p transfers the problem of computing e(N) 
into that of finding a bound for \%fj N (£) — ^ (01 ^ or sufficiently large ^. 

Theorem 2.3. Let C and 5 be two constants independent of N and let < |£| < 6N^-~ b ^' 2 . 
We have 

\M0-m\<§^r b e- e/2 - (2-16) 

Throughout this paper C will denote a constant, which may be different at each occur- 
rence. 

Remark 2.4. Theorem 12.31 is of interest in its own right. Such bounds are called Berry- 
Esseen inequalities. They determine rates of convergence to central limit theorems, 
usually for sums of independent or weakly dependent random variables. The eigenvalues of 
random unitary matrices, however, exhibit a high degree of correlation. 

For eigenvalues of random unitary matrices, one consequence of such a strong depen- 
dence is that the variance a 2 = Tr AnA* n /(2N) remains finite in the limit N — > oo. Instead, 
the variance of the sum of N independent random variables grows linearly in N. When the 
moments diverge in the limit N — > oo, just the first few are enough to determine an opti- 
mal bound. Since the right-hand sides of equations fl2.5ajl and (12.5bj) converge to normal 
random variables without any normalisation, the proof of theorem \2.3\ requires knowing the 
first 2N moments of X^. 



3 Moments and cumulants of 

The purpose of this section is to provide bounds and asymptotic formulae for the moments 
and cumulants of Xn that will be needed to prove theorem 12.31 Most of these can be 
derived from the results of Samuel [22] , which we summarise in §3.11 
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3.1 Averages of matrix elements and the symmetric group 

Samuel [22] studied averages of the form 

U(AT) 

= M *A N ) S hk«i ■ ■ ■ 5i m k„Jhin ■ ■ ■ S jm i Tm , (3.1) 

o-,r€6 m 

where & m denotes the symmetric group of degree m. The moments of Xn are simply linear 
combinations of these integrals. 

All the information on the averages (13. ip is contained in the coefficients M a ^ T (N). A 
permutation of m letters can always be factorised in a product of cycles. It turns out that 
M ajT (N) depends only on the cycle decomposition of crr^ 1 . 

The lengths of the cycles of a permutation identify a sequence of non-negative integers 
A = (Ai, . . . , Xk) such that 

Ai > • • • > Afe and |A| :— X± + ■ ■ ■ + X k — m. (3.2) 

In other words, there exists a one-to-one correspondence between the cycle structures of 
& m and the set of partitions of m. The partition X(g) is called cycle-type of g G @ m . 
Therefore, we shall adopt the notation 

M X (N) :=M UtT (N), (3.3) 

where A is the cycle- type of ar^ 1 . 

A partition of m is denoted by A h m; the addends Xj are the parts of A. An alternative 
notation for a partition is the frequency representation: if A contains r± Is, r 2 2s and so 
forth, we write A = (l ri 2 r2 . . . m rm ). The length of a partition £(X) is the largest j such 
that Xj > 0. We also have 

£(\) = ri + ... + r m . (3.4) 

We shall find it convenient not to distinguish between two partitions that differ only by a 
sequence of zeros at the end. For example, (3, 1, 1) and (3, 1, 1, 0, 0, 0) are clearly the same 
partition. 

Elements of & m that belong to the same conjugacy class share the same cycle-type. 
Therefore, the conjugacy classes of & m can be labelled by the set of the partitions of m. 
The number of elements in the conjugacy class A is 

9x ■= f (3-5) 

l ri ri! • ■ ■ m rm r rn \ 

Furthermore, the conjugacy classes of & m are in one-to-one correspondence with its irre- 
ducible representations, which can be identified with the set of partitions of m too. Since 
characters are class functions they depend only on the cycle-types of the permutations. 



8 



J. P. Keating, F. Mezzadri and B. Singphu 



The notation X\ indicates the character of the irreducible representation /i evaluated on 
elements of cycle-type A. 

Sometimes it is convenient to represent partitions using Young tableaux. If A = 
(Ai, . . . , Afc), we draw k left-justified rows of boxes, or nodes; the top row should contain 
Ai boxes, the next one A2 and so on. For example, let A = (5,4,4,3, 1). Then, 



is the corresponding Young tableau. 

Samuel [22] derived an explicit formula for M\ (N) when m < N: 



M X (N) :-- 



dim V, 



-y 



fJ- 11 

-Xx, 



A h m, 



(3.6) 



fj,\—m 



where 



and 



fx(N) :- 



dim V\ 



fihm 



dim V\ 



mi 



. Tii<i<j<e(\) 



(Ai - Xj+j - i) 



n£!(Ai+*(A) 



j) 



(3.7) 



(3. 



is the dimension of the irreducible representation V\. 

The right-hand side of (13. 7p is polynomial in iV of degree m. It turns out that f\(N) 
has only integer roots, which have a simple representation in terms of the Young tableau 
of A; they are given by all the differences % — j, where i counts the rows of the diagram 
in descending order and j counts its columns from left to right. For example, if A = 
(5, 4, 4, 3, 1), then the roots of f\(N) are 






- 1 


-2 


-3 


-4 


1 





-1 


-2 




2 


1 





-1 




3 


2 


1 




4 









We shall give a proof of this property later in this section. 

Since the characters of the irreducible representations of the symmetric group are know 
via Frobenius's character formula, equations (13. 5p . (13. 6p and (13. 7p completely determine 
the averages (13. ip . 
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Let Ai, . . . , Afc be the parts of a partition Ahm. (We do not impose any ordering on 
the XjS.) The coefficients M\(N) obey the recursion relations [22] 

(TV) 

p+q=\ k 

k-i (3.9) 

+ A i M (Ali ... iA ._ liA . +Afc ,A,- +1 ,...,Afc -i) ( N ) > 

3=1 

with initial condition M Q (N) = 1. These equations do not depend on permutations of the 
AjS and are a complete set, which uniquely determines the coefficients M\(N) for A h m 
in terms of those for Ahra-1. 

Traces of powers of matrices are homogeneous symmetric polynomials in the eigen- 
values. Symmetric functions are intertwined with the character theory of the symmetric 
group. Therefore, it is not a surprise that the formalism of symmetric polynomials will 
become useful in computing the moments and cumulants of X^. 

For every j the power sum of m variables is 

Pj(xi, . . . , x m ) := x{ H Yx ] m . (3.10) 

Next, we extend the definition f)3.10p by taking the product 



m 



P\ ■=P\ 1 •••Pa* = ( 3 ' n ) 

3=1 

where the rjs are the frequencies of A. Now suppose that £(X) < m. The Schur function 
s\(xi, . . . , x m ) is defined by the ratio of two m x m determinants: 

det (x^ +m - j 



s A (zi,...,z m ) := d \ x m-j\ ■ ( 3 - 12 ) 

Schur functions are homogeneous symmetric polynomials of degree |A| and are related to 
the power sums by the formulae (see [IB], p. 114) 

s\ = —. duX^Pn and p^ = ^ xJ*a- (3-13) 

/ihm Ahm 

If xi, . . . , x m are the eigenvalues of a m x m matrix X we write p\ (X) = p A (xi, . . . , x m ) 
and s x (X) = s x (xx, x m ). 

Corollary 3.1. The polynomial ( 13. 7p can be factorised aJ^\ 

/»w = e ^ iv ' M = n <* - + a = n ( ,s-7y • (3 - l4) 

where the pair span the row and column indices of the Young tableau of X. 



1 Samuel conjectured this formula in the appendix of his article |22) . but did not give a proof. 
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Proof. Let A h m and N be a positive integer, then px(In) — N e ( x \ Therefore, from 
formula fl3TT3|) for N eZ+ 

/ A (iV)=m!^lM. (3.15) 
dim V\ 

The irreducible representations of the symmetric group and of GL (N, C) are related by the 
Schur-Weyl duality. If N > £(X) the Schur functions are precisely the irreducible characters 
of GL(iV, C). Thus, equation (I3.15P connects the dimensions of irreducible representations 
of & m and GL (N, C) corresponding to the same A. Now, we have 



l<j<k<N J 



Combining this formula with ( 13 . 15[) and ( 13. 8 j) gives equation ( 13. 14[) . □ 
3.2 The moments 

Formula ( 12. 7p implies that ipN is an entire function. Therefore, the series 

M0 = J2^rVn (3.i7) 

71=0 

converges in all the complex plane and defines all the moments of X^, which identify its 
probability distribution uniquely. 

Now, consider the Taylor expansion of the integral (12.71) : 

= E^ (§)"£ (I) / (™r™(w*r *„(f). (3.i8) 

n=0 v 7 m=0 v / ^ U (,7V j 



Since Haar measure is left and right invariant, the integral in this sum is zero unless n = 2m. 
Therefore, 

« = E^r (£) (3.19) 

it) — n v * / v / 



m=0 

where 



I%(A N ):= [ \Tr A N U\ 2m du. H (U). (3.20) 

•/u(JV) 

Thus, the moments of Xn are given by the formula 

^ 2m = ml (2cx 2 )™ J ^ ' ( ^ 3 ' 21 ' ) 

Proposition 3.2 (Samuel 1980). Let m < N and let A = (F 1 . . . m rm ) denote a partition 
of m. We have 

I™(A N ) = m\J2 9xM x (N) p x (A N A* N ) . (3.22) 

Ahm 
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Proof. The right-hand side of equation (13. ip can be re-written as 

M p ( N ) ^iVi ' ' ' 8 imk pT Jhhi ■ ■ ■ S jm i Tm , (3.23) 

p,re6 m 

where we have shifted the index in the sum by setting p = ar' 1 and used the fact that 
M ffi7 -(JV) depends only on ar^ 1 . By multiplying equation (13.231) by A^j^, • • • , A^j m i m and 
AjVij^, . . . , A^i m k m and summing over all indices, we obtain an expression of the form 

M p( n ) J2 J2 " ' A NapA Nj pA NlS A~ Nri 5 ■ ■ ■ (3.24) 

pe&m re&m ...,a,(3,j,S,ri,... 

Consecutive indices in the inner sum, say (3 and 7, are of the type k pTV and l TW respec- 
tively, where w = r~ l prv. Thus, the collection of the addends such that v = (r _1 pr)"' v 
contributes with a factor Tr (AnA* n ) j . 

Each letter belonging to a cycle of length j is a fixed point of order j of every element 
in the conjugacy class of p. The inner sum in ( I3.24p depends only on powers of r _1 pr, and 
therefore is a class function and is independent of r. Each cycle of length j produces the 
factor Tr (AnA* n Y . Therefore, we have 

m 

■■■ A Na pA Nl pA NjS A NriS = ]J(Tr (A N A* N ) j ) rj = Px (A N A* N ) . (3.25) 

...,a,/3,7,<5,r?,... j=l 

Finally, formula (I3.22p follows from the fact that M p (N) is a class function. □ 

Remark 3.3. The integral ( I3.22p . and thus by ( I3.2ip the moments too, are linear combi- 
nations of the coefficients M\(N), which have poles at the zeros of f\(N). Such poles are 
related to certain singular integrals over U(N), which appear in lattice Quantum Chromo- 
dynamics and were first noted by De Wit and 't Hooft [8J . They observed that such integrals 
are divergent for certain values of N. The moments of X^, however, are always finite. 
The reason why the De Wit- 't Hooft anomalies do not affect formula ( I3.22p is because it is 
correct only for m < N , and by corollary 13.11 the greatest zero of f\(N) is m — 1. 

As we mentioned at the beginning of this section, in order to prove the Berry-Esseen 
inequality (12.161) we need bounds and asymptotic formulae for the moments and cumulants. 
The evaluation of the right-hand side of equation (I3.22p requires Frobenius's character 
formula, which is quite cumbersome to use when explicit formulae are needed. It turns out 
that (13.221) can be expressed in terms of Schur functions, which allow it to be manipulated 
explicitly. 

Corollary 3.4. We have 

f»» = ^ ~2yn l E [U j+xf-jy) dim ^A(A^). (3.26) 
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Proof. From formulae (13. 6p . (I3.13P and (13. 22p we obtain 

I%(A N ) := ml Y,9xM x (N)p x (A N A 



N) 
Ahm 

E E ynjfav&px (a n a* n ) (3 27) 

, \- dim 



Equation (I3.26P follows from formula (13.141) . □ 

We are now in a position to find asymptotic formulae for the first N moments of Xn- 
Let us denote the moments of Af(0, 1) by ^- e - 

^:=(2m-l)H. (3.28) 

Proposition 3.5. VKe have the following bounds: 

/i 2m = fif m (l + O {{mk) m N- 1 )) (3.29) 

and 

V2m < {N b k) m (3.30) 
Proof. From equations (I3.6P and (I3.22p we have 

^ 2m = (2a 2 ) m m\ E E JJn^ 9xX ^ Px ( AnA *^ ' ( 3 ' 31 ) 

The first step consists in finding bounds for p x (A N A* N ) and ffj,(N). Remember that by 
definition (12. 9p . the greatest singular value of A N is bounded by \/kN b , where k = 0(1) 
and < b < 1. We have 

Tr (A N A* N y = {2a 2 ) 3 N^c, (A N ) , (3.32) 

where Cj (An) = O (A;- 7 ) and a = 1 — b. Note that by definition o is independent of A/v, 
therefore c\ (An) — 1. It follows that 

m 

Pa(A^) = n(Tr(A^) J ) rj = (2a 2 ) m iV^ A > +6m c A (Ajv), (3.33) 

where 

m 

ca (Ajv) = J] Cj (A N ) r > = O (k m ) . (3.34) 
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Denote by A e = (l m ) the cycle-type of the identity in & m . Combining equations (I3.33|) 
and (I3.34p we obtain. 

p x (A N A* N )= (2a 2 ) m N m x j 1 , , „,.,. if A = Ae ' (3.35) 
Now consider /^(iV). We can easily see that for N > m + 1 

m m 

f {lm) (N) = ]J(N - j) < UN) < / (ml) (iV) =l[{N + j), fi h m, (3.36) 

i=i 3=1 

where (m 1 ) and (l m ) correspond to the trivial and alternating representations respectively, 
which are both one-dimensional. We can re-write the inequalities (I3.36P in the following 

way 

j'=i v 7 i=i v 7 

N m m\ , . , iV m (2m + 1) • • • (m + 2) 

< /„ W < 1 7 (3.37) 



(m + l) m - 7 " (m+ 1) 

The sum (13. 3 1 p can be split as follows: 

_ u, G N m ^ {drmV x f 
M2m ~ ~mT ±^ f^N) 



^° v^v^dimV^ 

+ E E j^^x (AjvAv) • 

v 7 Ahm uhm 7 

The first sum on the right-hand side can be estimated using the bounds (13.361) 

ml it &(*0 (3.39) 
= /i G (l + 0(e m A^- 1 )). 

Using the same ideas, we write 



(dfc E = (^fe E XS dim *(1 + O (^)). (3.40) 



Irreducible representations of finite groups can always be chosen to be unitary. Therefore, 
we have that |%^| < X\ = dimV^. Thus, using the orthogonality of the characters, the 
sum (I3.40p becomes 

» G R{m,N) 

(2a 2 ) m N m} { } 
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where R(m, N) = O (e m N~ 1 ). Finally, inserting equation (13.41 j) into (I3.38|) and using (I3.35P 
we obtain formula (I3.29P 

In order to prove equation (I3.30p . recall that from formulae (13.261) and (I3.14p we can 

f o 

t L 2m (A ' T / \2 S X 

^ m = (2n*) m m\ E (dim Vx) s,(T,A ■ (3 - 42) 



write 



(2aTm\^ K X > s x (I_ 

Ahm 

The greatest singular value of An is bounded by 2a 2 VkN b . Since the eigenvalues of A^A* N 
are non-negative, s\ (A N A* N ) is positive and 

sx(A n A* N ) < (kN b ) m s x (I N ). (3.43) 

Then, equation (I3.30p follows from the orthogonality of the characters. □ 

An immediate corollary is the convergence in distribution of to Af(0, 1) (D'Aristotile 
et al. [10J). For fixed m formula (I3.29P gives 

lim fi 2m = V2m- (3-44) 

Af->oo 

The bound (13.301) plays an important role in the proof of the Berry-Esseen inequal- 
ity (I2.16p . For N ^> m (I3.29P is a better bound; however, it becomes much worse when 
m ~ N. This is an important regime. As we shall see, when 6 = 1 the right-hand side 
of (I3.30P is too large to allow (I2.16P to be valid for a range of £ sufficiently large for our 
purposes. We believe that the correct bound for /z 2 m is much smaller than both (I3.29P 
and (I3.30p . The reason is that the sum 

E E jm 9>xSpx (A N A* N ) (3.45) 

Ahm fihm ^ 

is characterised by a sequence of cancellations. 

Remark 3.6. It is worth noting that, since the integral on the right-hand side of equa- 
tion (I3.18P is zero unless n = 2m, the proof of proposition 13.51 also demonstrates that the 
random variable Z^ = Xn + iYn = Tr A^U converges in distribution to a complex normal 
random variable Z , whose centred moment^ are 

E {Z m T} = 5 mn a 2m m\. (3.46) 



2 Here we have adopted the convention that the real and imaginary parts of a standard complex normal 
random variable have variance ^. Therefore, if we had studied Tr Aj^U, instead of its real and imaginary 
parts separately, we should have set a 2 = Tr A^A* N /N. This explains the discrepancy of a factor 2 m in 
the notation used in equations p.47[) and (|3.46p . 
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Remark 3.7. When An = I the first N moment are exactly gaussian independently of N. 
This is a particular case of a more general result proved by Diaconis and Shahshahani [9] 
and can be easily recovered in our formalism. We have 



Ahm 

(^TEEfm¥ iN ' w (3-47) 

Ahm /uhm JlX \ ' ^ X <= 

(2^ 2 ) m E^X=- ! (2- 2 ) m , m<N. 



3.3 The cumulants 

The characteristic function i^>n{0 is entire and by definition ip^i®) — 1- Therefore, the 
Taylor series 

n! 



< • n I 



n=l 



converges in a neighbourhood of the origin. The coefficients K n are by definition the 
cumulants of and determine uniquely its probability distribution. They are related to 
the moments by the recurrence relation 

Kn = - ^1 (? 1 j KkVn-k- (3-49) 

fc=l ^ ' 

The choice of whether to use the moments or the cumulants depends on the information 
that one is seeking to extract. It turns out that in the proof of the Berry-Esseen inequal- 
ity (I2.16P we shall need the asymptotic behaviour of both. The purpose of this section is 
to derive a bound for /t 2m for m < N . 

Let A h n and define 

n 

where the rjs are the frequencies of the partition A. There exists an elegant formula (see, 
e.g. [IS] , pp. 30-31) that expresses the moments as polynomials in the cumulants: 

/i n = ^c A KA, (3.51) 

Ahn 



where 

n\ 

(l\) ri ri \~(m\y m rj. 



c a : = JT^rTZ , TZ^TZ , ( 3 - 52 ) 
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is the number of decompositions of a set of n elements into disjoint subsets containing 
Ai, . . . , A n elements. Similarly, equation (I3.49p can be solved for the cumulants: 

n n = (-l) i(A)_1 (t (A) - l)!c A /i A , (3.53) 

Ahn 

where 

n 

fi x := ^■■■ f i Xi = Y[fi r /. (3.54) 

i=i 

All the odd moments of are zero, therefore all the odd cumulants are zero too. 
Thus, f)3.5ip can be rewritten as 



A*2r, 



^-^"^ (ir-^-D-r - (3 ' 55) 



where we have used the notation 2A = (2Ai, . . . , 2A;) . 

The 2m-th moment of is a polynomial of degree m in the traces Tr [A^A* N y \ the 
recursion relations (I3.49P imply that the 2m-th cumulant is also a polynomial of degree m 
in the same variables. Therefore, we can write 

K 2m = (2 ^~i )H Yl 9xK x (N) Px (A N A* N ) . (3.56) 

^ ' Ahm 

If we know the asymptotic behaviour of K X (N), then we can determine that of the cumu- 
lants. In turn, the coefficients K X (N) are related to those of M X (N). 

The union A U p is defined as the partition whose parts are those of A and p arranged 
in descending order. Cumulants have a combinatorial interpretation in term of partitions 
of sets; let us define 

M x (N):=J2 a *TlK( N l ( 3 - 57 ) 

A ueA 

where A runs through all possible distinct decompositions of A as a union of sub-partitions. 
The meaning of a& is better explained with an example. Consider the partition A = 
(Ai, A 2 , A3) and write 

M (Ai,A 2 ,A 3 ) = ^(Ai,A 2 ,A3) + ^(Ai)^(A 2 ,A 3 ) + ^(A 2 )^(Ai,A 3 ) ^ ^ 

+ ^(A 3 )^(Ai,A 2 ) + K {\i) K [\i) K {XaY 

If the Ax, A2 and A3 are all different, then each summand in (I3.58P is distinct, but if 
some parts of A are repeated, this is not the case. For example, let A = (3,1,1), then 
(A2) U (Ai, A3) and (A3) U (Ai, A 2 ) are the same decomposition of A and 
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The coefficient oa is precisely such a multiplicity. Computing it is an exercise in elementary 
combinatorics. 

Let A h m and define 71^ to be the number of times that a partition u, appears in the 
decomposition A = |J M eA f 1 - Furthermore, let rj and denote the frequencies of j in A 
and [A respectively. We have 



3=1 MGA S 5 !7r ^ ! 



Proposition 3.8. The coefficients K\(N) andK' x (N), defined in equations f!3.56j) and (13.571) 

respectively, coincide. 

Proof. For the sake of simplicity, let us set Xj := Tr (AnA* n Y and X\ := ■ ■ ■ x r ™ . By 
inserting equation (I3.56P into the right-hand side of (I3.55P we see that 

Ahm A neA 

Similarly, by substituting (I3.57P into (I3.22p we obtain 

Ahm A fieA 

Since the right-hand sides of equations (I3.6ip and (I3.62p identically equal for arbitrary /i, 
we need to show that 6a = b' A 

Equations (13.551) and (I3.56P give 



(2m-l)!!v^ A /v- T ^^ T , 
/i2m= (2^) m 2^ C ^[[[1^^K^N) 

^ ' Ahm j=l \n\-j 



= £ (3.63) 

v ' Ahm j=l fthj M V ' 

£*>= r j 

(2m-l)!!m!^ 1 >^ -j-r ^ 

V ' Ahm K ' KJ ' A [iEA V 

In the last passage ir^ assumes the same meaning as in equation (I3.60p . i.e. it is the number 
of repetitions of a partition /i in the union A = IJ^ga/ 1 - Now, let A = (l ri . . . m r ' m ) and 
/i = (l s i . . . j s j ' ) with Ahm and /i h j. The frequencies of A and /i are related by 



r fc = $>£. (3-64) 

mga 



Furthermore, by definition we have 
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Thus, combining equations ( I3.63p . (I3.64p and (I3.65j) we arrive at 

^FrT^rEIinp^n^W^' (3.66) 



(2m)! 



Finally, equations ( EOT]) . fl3T22|) . fl337jl and CTT]) give 

= e ft 2 ^ e fi n n cw- < 3 - 67 ) 

□ 

Brouwer and Beenaker [3] computed the leading order asymptotics as N — > oo of K\(N). 
By inserting the right-hand side of (13.571) in equations (13. 9p we derive the recursion relations 

fc-i 

NK (Xl ,..., Xk) (N) + £ A\ A , a,,,,, (iV) + )T A ; A :A: A , 

i,Aj+Aj.,Aj-|-i,...,Afc_ 1 ) 

(iV) 

p+q=X k j=l 
k-1 . 

+ E E ? -if fc _ ? -_i)i E ^M,...^ P )(^)K W , + i),..., CT ( fc -i), 9 )(iV) = (3.68) 

with Kq (N) = 1. The solution to these equations to leading order is 

K X (N) = (_i) m +^ A ) 2 ^ A )jV- 2m -^ A )+ 2 

(2m + £(A)-3)!fr ((2j-l)!) rj , ^ r _ MAn (3.69) 



(2m)! IX ((J _ 

We are now in a position to state the main result of this section. 
Theorem 3.9. We have 

K 2m = O ((2m)\N^ 2 ~ b ^ m ^) . (3.70) 
Proof. This bound follows simply by combining equations (I3.69p . ( I3.56P and (I3.35p . □ 

4 Proof of the Berry-Esseen inequality 

In order to prove the Berry-Esseen bound (12.161) . we need an estimate of the radius of 
convergence of the cumulant expansion (I3.48p . 

Lemma 4.1. There exists a constant 5 > such that iPn(0 > for < |£| < 5N~ . 
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Proof. Since i^n(0 is entire, the radius of convergence of the Taylor series of log V>iv(£) is 
given by the location of the nearest zero to the origin of iPn(0 ■ 
By definition 

PCX) 

\Mt)\ < <M0) = / f N (x)dx = 1. (4.1) 



Suppose that ■*/>(£) has real zeros and let £ be the closest to the origin. Since V'aKO is even, 
we can assume that £ is positive. For |£| < £, < iPn(0 < 1, therefore the Taylor series of 
log^iv(0 i s convergent in (—£,£). Thus, it also converges in a circle centred at the origin 
and of radius £. In other words, there are not any complex zero of ip whose distance from 
the origin is less than £. Therefore, in the rest of this proof we can take £ to be real and 
positive. 

A general formula (see [12], p. 514) for moment generating functions gives 



fc-i 



3=0 

Let us consider the two sums 



< (f^ 2 , (4-2) 



:= ^(-1)^^, (4-3a) 
V2s(0 := E(-1) J t£,^- (4-3b) 

3=0 1 

Since the Taylor expansion of ?A/v(£) is an alternating series, equation (14. 2 p implies 

T>2r-l(£) < <M0 < V 2s (0 (4.4) 

for any pair of integers r > 1 and s > 0. By definition \i 2 = 1, thus the lemma is trivially 
true for £ 2 < 2. 
Let us write 

exp (-w£ 2 /2) = twar-i (0 + u 2r (0 , (4.5) 

where 



2r-l 

(2j) 

i=o v J; 



w 2r ^(0 : = « 4, (4.6a) 



«*(*) == £ ("l)*" T^g- (4.6b) 

Recall that //^ = (2j — 1)!! denotes the moments of A/"(0, 1). We choose w > 4e 2 and 
independent of r. We now want to show that for r < N there exists an appropriate u such 
that 

0<W 2r -l(O <«2r-l(0 (4-7) 
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in the interval 

r 2e(r-l) 1/2 2er 1 / 2 



Since uC, 2 < 4e 2 r the summands in the reminder (14.6bj) are strictly decreasing. There- 
fore, we can write 

«;ar-i(0 = exp (-ue/2) - £ (-1)' ^jig 

i=2r ^ ^ ( 4 9 ) 

> exp (-u;£ 2 /2) - -^fA** > 0- 

The last passage is a straightforward consequence of Stirling's formula. 

Now, both u>2r-i(0 and v 2r -i(£) are alternating sums. Therefore, w 2r -i(0 < ^2r-i(0 

if 

G G 

^2(2j) OjMj _ /i 2(2j+l) oj+l t4j+2 < A t 2(2j) ^ _ /^2(2j+l) ^4^+2 1 ^ 

(4j)! * "(4j + 2)! W 5 "(4i)! 5 (4j + 2)!^ 
for j < r — 1 . This equation can be rearranged as follows 

( 2j fa -m)< — ^- ( 23+1 L« +i - ^m) . (4.11) 

V 2 J { )-{2j + l)\2j { J ^ ) 

If we choose u> > N b k, this inequality holds for r < N because of proposition 13.51 and 

1 /2 

equation (13. 29p . Thus, the statement of the lemma follows if we set 5 = 2e/ (N b u) . □ 

We are now in a position to prove theorem 12.31 From theorem I3.9[ we know that for 
m < N 

n 2m = O ((2m)\N-^ m -^) . (4.12) 

Furthermore, from formulae f 1 3 . 2 1 j) and (I3.49P it is straightforward to compute the first 
few cumulants. We have 

k 2 = 1, (4.13a) 

_ 3 Tr(A N A* N -2vI N ) 2 

4a 2 iV3(l-l/iV2) ' {A - Ub) 

15 Tr (A N A* N -2aI N f 
6 2iy%3(l-l/iV 2 )(l-4/iV 2 )- K } 

Since the cumulant expansion converges up to £ < SN^t 1 ^, there exists a parameter 
9 such that 

log^(0 = -f +0^- (4-14) 
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It turns out that 9 = 0(1) as N — > oo. Now, recall that the moment generating function 
of 7V(0, 1) is ^(0 = e~* 12 . Therefore, we can write 

\1>n (0 - ^(01 = \e^ N(2 - b) - l| < J^ e ^-\-e/*, (4.15) 

where we have used the inequality \e z — 1| < \z\ e' z L The exponential e e ^l N2 b is bounded 
in N provided 9 = 0(1). Therefore, the right-hand side of (I4.15P becomes 

\^N(0-m\<^tr b e- e/ \ (4-16) 

where C can be chosen independent of N. 

To complete the proof of equation (I4.16p . we need to show that if £ < 5N2^~ b \ then 
9 = 0(1). Let us write the cumulant expansion as 

logMO = -j + -f- + MN), (4.17) 

where 

oo elm 



m=3 



If a series Y2m=i c ™ converges , then c m — Y as m — > oo. Therefore, for m > N we must 
have. 

K 2m = o((2m)\(5 2 Ny m{1 ~ b) y m^oo. (4.19) 

Thus, combining equations (I4.12p and (|4.19p . the reminder (I4.18P can be bound by the 
series 

V — + 2 ^ - V ^ U 20) 

m=0 m=0 ' 

where Ci and C 2 are constants and 5* > S. For £ < ./V^ 1-6 ) this sum is 0(1) as iV — > oo, 
which implies that 9 cannot be an increasing function of N. 

Remark 4.2. There is striking difference between the superexponential rate of convergence 
discovered by Johansson [15J when A N is the identity and the rates of Theorem \2.U Indeed, 
superexponential rates of convergence to central limit theorems are unusual in probability 
theory. Theorem 12.31 provides some insight into this. When An = In the first N mo- 
ments of Xn are gaussian (see equation (I3.47P ) and its first N cumulants but k 2 are zero. 
Therefore, equation (14.161) turns into 

fic2{N+l) 

\M0-m\<^w^ e ~ e/2 - ( 4 - 21 ) 



22 



J. P. Keating, F. Mezzadri and B. Singphu 



5 Proof of theorem 12.1 



5.1 Preliminaries 

Let us set S N = SN^/ 2 and T N = N^, where 7 > 2. Theorem 12.31 allows us to split the 
right-hand side of (I2.14p as follows: 

0(~* f&N O pN 1 r\ /•TV 7 



N 2 6 7T J 7rS N J Sn 7rS N J Sn 

24 " 

+ 



The upper limits of integration can be replaced by infinity. The first integral gives the 
desired bound. We need to show the remaining terms are of lower order. 
The second integral in equation (15. ip can be rewritten as 

.■>.-. I < ■ I / ■ . — \ 

(5.2) 



/ e -e/2 di < i /Eerie (5N^ 2 /V2 



where 



2 ro ° 



erfc(t) -=^J e~ x2 dx (5.3) 

is the complementary error function. Since erfc(t) satisfies the inequalities (see, e.g. [1], p. 
298) 



< e* 2 / e^df < 7 , (5.4) 



t + J t t+ A /W 



7T 



the second integral in (15. ip can be neglected. 

The last task that we are left with is to estimate the integral 



\MZ)\<% = 0(N- 2 ). (5.5) 



5.2 Regularity properties of the distribution of 

In general we do not have an explicit formula for iPn(0 in the interval (5N^~ b ^ 2 , iV 7 ). 
Thus, in order to estimate its behaviour in this range we need to adopt an indirect ap- 
proach. The idea is to approximate Xn with a random variable X* N whose characteristic 
function allows us to control the third integral in equation (15.11) . Then, we will estimate 
the difference between e(N) and 

e*(N) := sup|F^(x) - , (5.6) 

where is the approximate distribution function of X* N and $* is the distribution function 
of a random variable close to jV(0, 1) (in a sense that will be made precise later). 
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We first need to discuss some regularity properties of the probability distribution of 

Lemma 5.1. If N > 1 the distribution function Fn is absolutely continuous, it admits the 
integral representation 

F N {x) = [ f N {t)dt, (5.7) 



where /at G L l (R), is bounded and uniformly continuous. Furthermore, iPn{0 — o(£ 1 ) 
as £ — > oo. 

Proof. Denote by Q(N) the maximal torus of U(N), i.e. the group of diagonal unitary 
matrices 



diag (e i6l ,...,e i6N ) 



(5.f 



y e i0JV y 



and write W (iV) = XJ(N)/Q(N). An explicit expression for Haar measure on U(iV) is 

d^u (U) = Yl sin2 ( fc 2 ^ J • • • d6 N dfi Wj (5.9) 

^ ' ' l<j<k<N ^ ' 

where cf//\y is a normalized Borel measure on W (N). 
Now, recall that 

Xn = ai cos^i + • • ■ + cos 9m, (5.10) 

where ai, . . . , ajv are the diagonal elements of A ^. Thus, we can integrate out dfi-w and 
study the measure 

nN(N-l)/2 / q 

= (2 Njvjy-j II sin2 ( % " ) d9 ^'" d6 ff- ( 5 ' n ) 
^ ' ' i<j<k<N ^ ' 

Since X^r is an absolutely continuous function of 9i,...,9^, if D C [0,27?)^ is a set 
whose image Xn(D) has Lebesgue measure zero, then D must have zero measure too. It 
follows from equation ( 15. lip that P {X^ G B} = for any set B of Lebesgue measure 
zero. Therefore the probability distribution of Xn is absolutely continuous. Since the only 
absolutely continuous measures on K. are only those that have a density, F N admits the 
integral representation (15. 7p . 

We can say more about f^- The measure e l ^ XN dfi@ is a differential form on the N- 
dimensional torus. Let 

aw = — max |Tr AnU\ = a\ + ■ ■ ■ + a^. (5.12) 

o ueV(N) 



3 In section [5751 the ability of estimating a bound for |/at(x) — <j>{x)\ will be essential. Even though U(iV) 
is compact and Haar measure is absolutely continuous with respect to the Lebesgue measure on 1^ , it 
is far from obvious that /jy is bounded or even continuous in all R. For example, lemma [57T1 is false for 

N = 1. Indeed, a direct calculation gives fi(x) = (l — (x/a) 2 ) ^ 2 . 
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For any neighborhood of Xn £ [— aN,aN] we can find a local change of variables that 
allows us to write 

e iiXN du, e = e^ XN dX N A u>, (5.13) 

where u is (N — l)-form on Q(N), the symbol A denotes the exterior product and the 
roman d' indicates exterior differentiationH For example, we can choose 

xn = cl\ cos#i + a 2 cos 2 + • • • + ajy cos 9n 
ip 2 = ai sin (9 1 + 0) + a 2 sin (0 2 - ff) 

^3 = 3 (5.14) 

ifN = On, 

where is a real parameter. The Jacobian of this transformation is 

J(x N ,(/)j) = ^} XN,<P3 ) v = aia 2 cos/3sin(6» 2 (x N ,<fj) - 6>i (x N ,(pj)) 

O {Vi, . . . , UN) 

+ aiQ2 2 Sin/3 cos^, 0,) + 2 (x^, 0,)) (5-15) 

a x a 2 sin /3 , N 
H ^ COS(6'i(XAr,(/) j ) - 9 2 {x N ,(p j )). 

Thus, the map f)5.14p is invertible everywhere except, perhaps, on a surface 1 = f(0 2 ;f3) 
where the Jacobian is zero. Appropriate choices of the parameter in different regions of 
Q(N) allow to define the differential form u everywhere in Q(N). More explicitly, we have 

2 7V(iV-i)/2 . 2 f9 k (x,(f) j )-9 i (x,(f) j )\ ^ 

- - wrm^j J^-' ( —2 — ) ^ A ' ' ' A (5 - 16) 

Let if> an d X b e two differential forms of degrees p and g respectively. The exterior 
derivative of if) A x is a (p + 9 + l)-form given by 

d(i/jA X ) = d^Ax+(-l) P ^Ad X . (5.17) 

Now, e^ XiV is a 0-form on 0(iV). Away from the region where J(x, (f>j) = the inverse 
of the map (15.141) is differentiable with continuous derivatives. Therefore, we have 

— -co) ~^ du - ( 5 ' 18 ) 

One can easily verify by direct calculation that u is not closed, i.e. du ^ 0. 



4 While an absolute continuous measure dfi on a smooth manifold can always be interpreted as a dif- 
ferential form, it does not mean that it is the exterior derivative of another form. Indeed, dfj,® is not. We 
use the notation 'd' to emphasise this difference, because it is important in what follows. 
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If -0 is a differential form of degree p and f2 is a manifold of dimension p + 1, then 
Stokes' theorem states that 



cty> = / (5.19) 
Jan 

where dfl denotes the boundary of Q. An N- dimensional torus is a compact manifold with- 
out boundary, therefore the right-hand side of (I5.19P is zero. As a consequence, integrating 
both sides of equation ( 15. 18ft we obtain 

V>at(0= / e^ XN (dX N Acu) = [ e^ XN dco. (5.20) 
Je(N) z £ Je(Af) 

It follows from the Riemann-Lebesgue lemma that iPn{0 — o(^ _1 ) as £ — > oo and is 
integrable. Thus, the inverse Fourier transform 

1 f 00 

f N (x) = — / e-^iv(e)^ (5-21) 



2vr „ 

is well defined, bounded and uniformly continuous. □ 



5.3 Smoothing 

From the discussion in §5.11 it follows that a necessary (not sufficient) condition for 
Is 1^(01^ to decay fast enough is t/>jv(£) — O (£~ 4 ) as £ — > oo. Indeed, if An is 
of full rank and its spectrum is not degenerate, equation (I2.8I) and the asymptotic formula 

Jk{ x ) ~ y — cos [ x — \k7r — ^tt) , x — y oo (5.22) 

imply that ip^ (0 = 0(£ _Ar2 / 2 ). Therefore, /at has continuous derivatives at least up to 
order N 2 /2 — 2. In other words, Jn becomes increasingly smooth as N grows. 

If a function has continuous derivatives of order p, then its Fourier transform is o (£~ p ) 
as £ — > oo. This suggests smoothing F N with an appropriate test function. More precisely, 
we define 

poo 

F N (x) := [F N * X e] ix) = / F N (t) X e(x - t)dt, (5.23) 



where Xe £ QT (^) an< ^ * s normalized to one. Our choice will be the test function 



. — exp ( j— 2 ) if x G (— e, e), . 

Xt (x):={^ P V Hi) ) 1 ' (5.24) 

if x e R\ (-e,e), 



where 



-i 



1 



1 -x' 



g := / exp dx = 0.44399 . . . (5.25) 
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By differentiating and integrating by parts we obtain 



f* N (x) ■= F%(X) = [f N * X e] (X) = 

The convolution is positive and 

fx(x)dx 



fN{t)Xe{x ~ t)dt. 



(5.26) 



In — 
In addition, e C£° (R) too and 

r N (0 := 

where 



Xe (x) cfx ) = 1. 



(5.27) 



e^r N {x)dx = MOXe (0 



X e (0 := / e^ x Xe (x) dx. 



Let us introduce 



A(x) := F^x) - $(x), A*(x) := [A * X e] (x). 



Then, we write 



e(N) = max | A(x) 



e*(N) = max|A*(x) 



(5.28) 



(5.29) 



(5.30) 



(5.31) 



Formula (I2.14p still holds if we replace Vjv(0 an d ^(0 with ^at(£) an d ^*(0 
V'COXe (0 respectively. Indeed, let = 5N^~ b ^ 2 and TV = iV 7 with 7 > 2. We have 



e*(A0 < - 

7T 



+ 



d£ 



irS 



Wn (01 # 



A'"" 



IV>*(0l^ + 



N JS N 



24m 



(5.32) 



where 



Now, 



m := max \<p * xA 



(5.33) 



(5.34) 



where we have used |x e | < 1, which holds for any characteristic function. Therefore the 
Berry-Esseen inequality (I2.16P applies to \ip* N — too and 



s N 



d£ = (N 2 - b ) . 



(5.35) 
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Equation ( 15. 2 p gives 

-Ni r N~l 



\ W (0 1 # < / < g V o erfc (^ (1 " b)/ 7V2) . (5.36) 

J Sn J Sn V 

In order to complete the proof of equation (I2.12p . we need to show that, for appropriate 
choices of the smoothing parameter e, e(N) < Ce*(N), and that the integral 

N1 /-AH 

1^(01^= / hM0ll*e(0ld£ ( 5 - 37 ) 



is sufficiently small. The appropriate choice of e for which these two statements are true 
is a delicate balance. As e decreases e*(iV) will approach e(N). However, if the support of 
Xe is too small, its Fourier transform might spread for a range of £ > Sn large enough to 
prevent the integral (15.371) from decaying at a sufficiently fast rate. 

The leading order asymptotics of Xe(0 can be computed using the method of steepest 
descent. We report the calculation in the appendix. We have 



*««» = ^V71 COS(£f - (£f)I/2 -^ (5 , 8) 



x exp (-(ef) 1/2 - J) (l + O (M)- 1 ' 2 )), £? - x 



For this approximation to be meaningful £ > 1/e. Therefore, we cannot choose e < C/Spj, 
otherwise the bound on the decay rate of the integral (15.371) would not be adequate. 

It remains to establish if e = O (S 1 ^ 1 ) leads to a good enough approximation to e(N). 
In order not to loose information on the behaviour of A(x), the smoothing parameter 
needs to be comparable with the rate of oscillation of A(x). In other words, we need a 
bound on |A'(x)|. Such a bound can be obtained, once again, using the Berry-Esseen 
inequality (12.161) : 



/Sn 

POO POO 

+ 2 \MZ)\<% + 2 mo\dc- 



(5.39) 



By comparing this inequality with (15. ip . we see that the first and third integral are 
O (iV~( 2-b )); the second integral might possibly be bigger by a factor tcSn- It follows 
that 

\A'(x)\ < CS N e(N). (5.40) 



This is sufficient for our purposes. The following lemma completes the proof of theorem 12. 11 

Lemma 5.2. Suppose that |A'(x)| < e(N)/r)N- Then, for e = tjn there exists a positive 
constant C = 0(1) such that e(N) < Ce*{N). 
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Proof. Since A(t) is continuous and —1 < A(t) < 1, there exists & t £ M. such that 
e(N) = A (to)- Then, we have 

A(t + V) > e(N) f 1 - — ) , for y > 0. (5.41) 

Now set 

x = t + rr} N , t = rn N - y, (5.42) 

where < r < 1 is a parameter whose exact value is to be determined. Equation ( I5.4ip 
becomes 

A (x - t) > e(N) 1 1 - r H J , for \t\ < rn N . (5.43) 

V VnJ 

Substituting this bound into the definition 

/oo 

A{x-t) Xm {t) dt ( 5 - 44 ) 
-oo 

we arrive at 

e* (AT) > A*(x) > e(N) (1 - r) f X m (t)dt - e(N) [ X m (t)dt, (5.45) 

where we have used the inequality A(t) > —e(N) > — 1 and the fact that the linear term 
in ( I5.43P does not contribute because Xm (*) i s even- 
Now, it turns out that the two integrals in (I5.45P are independent of r] N . Indeed, 

/ Xm {t)dt = - [ exp(--^—)dt. (5.46) 

J\t\>r VN 9 Jr V 1 — 1 J 

Define 

h(r) := r + - (2 - r) J exp (- <f*. (5.47) 

Equation ( I5.45P can now be rewritten 

e*(iV)>e(JV)(l-/i(r)). (5.48) 

Since h(r) has a minimum near 2/3 and ft, (2/3) = 0.77646.. ~ 4/5, the statement of the 

lemma follows with C = 5. □ 

We achieve the rate of convergence in equation (12.121) if we set 
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6 Proof of theorem 12.2 



We need to prove equation (12.131) . 

Let us choose the parameter rj^ in lemma I5T21 to be 

UN = iV" c , 

where < C < |(1 - b). Equation (15T39]) gives 

\f N (x)-4>(x)\<C c N- 2+b ^ 

for some constant Cf. 

Now take (' > and write 

pN<' 

\f N (x) - <j>(x)\dx < 2 / \f N (x) - <f>(x)\dx 



We have 



POO POO 

+ 2 / f N (x)dx + 2 (j)(x)dx 

poo POO 

< C fif ,iV- 2+b+c+c ' + 2 / f N (x)dx + 2 

Jn<' Jnc' 



oo p—N^ poo 

2 / fN(x)dx = / f N (x)dx + / f N (x)dx 
nc' J -oo Jn<' 



4>(x)dx. 



F N (-^ C ') +1-Fn (n C 



Similarly 



oo p—N 1 * poo 

2 / (j){x)dx = I 4>{x)dx + / 4>{x)dx 
n(' J-oo Jnc' 



Therefore, rearranging the terms and using the identity 1 — = $(— x) we obt 

2 J ? M^cta + 2 y ? <f>{x)dx = (^F N (~^ c ') - $ (--^') J 

- (f* (M') - $ (M')^j +4^1 - $ (M')^j 
Finally, equation (gHHJ) follows from (gZTJ) and O by setting e = ( + C- 
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Appendix. The leading order asymptotics of % e (£) 

The purpose of this appendix is to compute an explicit formula for the leading order 
asymptotics of 

Xe (0 : = / e^ x X e(x)dx = — f exp [ i& ^ J dx (A.7) 

J-00 9£ J-e \ 1 - (2) / 

in the limit £ — >• oo. Since Xe(£) — Xi ( e £)> f° r t ne s& ke of simplicity we set e = 1 and write 
X — Xi- More explicitly, we study the integral 



X (0 = ~ Re jf exp fz£x - ^3^2 ) ( A - 8 ) 



Proposition. VFe /iai>e 



xexp(-e 1/2 -i)(l + 0(r 1/2 )), £^oo. 



(A.9) 



Proof. (1A.8|) can be estimated using the method of steepest descents. The integrand is not 
analytic at one, so we look at 



2 rl - s 



-Re / exp I i£x — ] dx, (A.10) 

9 Jo V 1 — x J 

where S > is small. The difference between ( ]A.8j) and (1A. 10h is bounded by 2Se~ 1 ^ 25 ^ /g. 

Since we are interested only in the real part of flA.lOj) . for large f the origin will not 
contribute to leading order; the main contribution should come from a small neighbourhood 
near one. 

Consider the argument of the exponential: 

f(x) :=ifr-—^. (A.ll) 

1 — x z 

Its saddle points are the solutions of the equation 

if (1 - x 2 ) 2 -2x = 0. (A.12) 
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For large £ the roots of this polynomial can be computed perturbatively in the parameter 
l/£. In other words, we look for a solution near one with an asymptotic expansion of the 
form 

(A.13) 



x 



where a is a rational power. By substituting this expression into (1A.12[) . one finds that the 
two sides of the equation can be balanced only if a = 1/2 and that the first two coefficients 
are 



Xi 



V2 



and X2 = 0. 



(A. 14) 



Im(a; 




Figure 1: The deformation of the interval [0, 1 — 5] and the saddle points of /( 



x) 



Provided 5 is sufficiently small, the interval of integration of (lA.lOj) can be deformed 
into a contour asymptotically equivalent to the steepest descent path passing through 



x 



(«) = 1 + 7ff^ + ° (r3/2) ' {_> 



oo. 



(See figured)) Such a deformation is not possible for the critical point 



(A.15) 



Trivial algebra gives 



1 



f(m) = i (e - e /2 ) - e /2 -- A + o (r 1/2 ) , e ^ oo, 

r(x(0)=2v / 2e-^ 3 / 2 + 0(e /2 ), £^oo. 
The tangent to the steepest descent path at x has equation 

x(t) = x{£)+te-l ni . 



(A.16) 

(A. 17a) 
(A. 17b) 

(A.18) 
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Therefore, we have 




dx 

(A.19) 



exp^/(ar(f))J, £ oo. 



Finally, by inserting equations f lA. 17|) into (IA. 19[) we arrive at flA.9j) . provided 5 < £ ^ and 
/9 > 3/4. 

□ 
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